智能论文笔记

Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning

Sixing Yu , Arya Mazaheri , Ali Jannesari

分类：计算机视觉 | 机器学习

2021-02-05

模型压缩是在功率和内存受限资源上部署深神网络（DNN）的必要技术。但是，现有的模型压缩方法通常依赖于人类的专业知识，并专注于参数的本地重要性，而忽略了DNN中丰富的拓扑信息。在本文中，我们提出了一种基于图神经网络（GNNS）的新型多阶段嵌入技术，以识别DNN拓扑并使用增强学习（RL）以找到合适的压缩策略。我们执行了资源约束（即失败）通道修剪，并将我们的方法与最先进的模型压缩方法进行了比较。我们评估了从典型到移动友好网络的各种模型的方法，例如Resnet家族，VGG-16，Mobilenet-V1/V2和Shufflenet。结果表明，我们的方法可以通过最低的微调成本实现更高的压缩比，但产生了出色和竞争性的表现。

translated by 谷歌翻译

Auto Graph Encoder-Decoder for Neural Network Pruning

Sixing Yu , Arya Mazaheri , Ali Jannesari

分类：计算机视觉

2020-11-25

模型压缩旨在将深神经网络（DNN）部署在具有有限的计算和存储资源的移动设备上。但是，大多数现有模型压缩方法依赖于手动定义的规则，这些规则需要域专业知识。 DNN基本上是计算图形，其包含丰富的结构信息。在本文中，我们的目标是从DNNS结构信息找到合适的压缩策略。我们提出了一种自动图形编码器 - 解码器模型压缩（AGMC）方法与图形神经网络（GNN）和加强学习（RL）结合。我们将目标DNN模拟为图形并使用GNN自动学习DNN的嵌入物。我们将我们的方法与基于规则的DNN嵌入模型压缩方法进行了比较，以显示我们方法的有效性。结果表明，基于学习的DNN嵌入实现了更好的性能和更高的搜索步骤的压缩比。我们在过度参数化和移动友好的DNN上进行了评估方法，并将我们的方法与基于手工和学习的模型压缩方法进行了比较。在参数化DNN（如Resnet-56）上，我们的方法分别优于3.36 \％$ 4.36 \％$ 4.36 \％$ 4.36 \％$ 2.56 \％$ 2.56 \％的准确性。此外，在MobileNet-V2上，我们达到了比最先进的方法更高的压缩比，只需0.93±％$精度损失。

translated by 谷歌翻译

Improved Long-Form Spoken Language Translation with Large Language Models

Arya D. McCarthy , Hao Zhang , Shankar Kumar , Felix Stahlberg , Axel H. Ng

分类：自然语言处理

2022-12-19

A challenge in spoken language translation is that plenty of spoken content is long-form, but short units are necessary for obtaining high-quality translations. To address this mismatch, we fine-tune a general-purpose, large language model to split long ASR transcripts into segments that can be independently translated so as to maximize the overall translation quality. We compare to several segmentation strategies and find that our approach improves BLEU score on three languages by an average of 2.7 BLEU overall compared to an automatic punctuation baseline. Further, we demonstrate the effectiveness of two constrained decoding strategies to improve well-formedness of the model output from above 99% to 100%.

translated by 谷歌翻译

A Major Obstacle for NLP Research: Let's Talk about Time Allocation!

Katharina Kann , Shiran Dudy , Arya D. McCarthy

分类：自然语言处理

2022-11-30

The field of natural language processing (NLP) has grown over the last few years: conferences have become larger, we have published an incredible amount of papers, and state-of-the-art research has been implemented in a large variety of customer-facing products. However, this paper argues that we have been less successful than we should have been and reflects on where and how the field fails to tap its full potential. Specifically, we demonstrate that, in recent years, subpar time allocation has been a major obstacle for NLP research. We outline multiple concrete problems together with their negative consequences and, importantly, suggest remedies to improve the status quo. We hope that this paper will be a starting point for discussions around which common practices are -- or are not -- beneficial for NLP research.

translated by 谷歌翻译

Estimating the minimizer and the minimum value of a regression function under passive design

Arya Akhavan , Davit Gogolashvili , Alexandre B. Tsybakov

分类： (统计)机器学习

2022-11-29

We propose a new method for estimating the minimizer $\boldsymbol{x}^*$ and the minimum value $f^*$ of a smooth and strongly convex regression function $f$ from the observations contaminated by random noise. Our estimator $\boldsymbol{z}_n$ of the minimizer $\boldsymbol{x}^*$ is based on a version of the projected gradient descent with the gradient estimated by a regularized local polynomial algorithm. Next, we propose a two-stage procedure for estimation of the minimum value $f^*$ of regression function $f$. At the first stage, we construct an accurate enough estimator of $\boldsymbol{x}^*$, which can be, for example, $\boldsymbol{z}_n$. At the second stage, we estimate the function value at the point obtained in the first stage using a rate optimal nonparametric procedure. We derive non-asymptotic upper bounds for the quadratic risk and optimization error of $\boldsymbol{z}_n$, and for the risk of estimating $f^*$. We establish minimax lower bounds showing that, under certain choice of parameters, the proposed algorithms achieve the minimax optimal rates of convergence on the class of smooth and strongly convex functions.

translated by 谷歌翻译

An Ensemble-based approach for assigning text to correct Harmonized system code

Shubham , Avinash Arya , Subarna Roy , Sridhar Jonnala

分类：人工智能

2022-11-08

Industries must follow government rules and regulations around the world to classify products when assessing duties and taxes for international shipment. Harmonized System (HS) is the most standardized numerical method of classifying traded products among industry classification systems. A hierarchical ensemble model comprising of Bert- transformer, NER, distance-based approaches, and knowledge-graphs have been developed to address scalability, coverage, ability to capture nuances, automation and auditing requirements when classifying unknown text-descriptions as per HS method.

translated by 谷歌翻译

Road Rutting Detection using Deep Learning on Images

Poonam Kumari Saha , Deeksha Arya , Ashutosh Kumar , Hiroya Maeda , Yoshihide Sekimoto

分类：计算机视觉

2022-09-28

道路车辙是严重的道路障碍，可能导致早期和昂贵的维护成本的道路过早失败。在过去的几年中，正在积极进行使用图像处理技术和深度学习的道路损害检测研究。但是，这些研究主要集中在检测裂缝，坑洼及其变体上。很少有关于探测道路的研究。本文提出了一个新颖的道路车辙数据集，其中包括949张图像，并提供对象级别和像素级注释。部署了对象检测模型和语义分割模型，以检测所提出的数据集上的道路插道，并对模型预测进行了定量和定性分析，以评估模型性能并确定使用拟议方法检测道路插道时面临的挑战。对象检测模型Yolox-S实现了61.6％的Map@iou = 0.5，语义分割模型PSPNET（RESNET-50）达到54.69，精度为72.67，从而为将来的类似工作提供了基准的准确性。拟议的道路车辙数据集和我们的研究结果将有助于加速使用深度学习发现道路车辙的研究。

translated by 谷歌翻译

RDD2022: A multi-national image dataset for automatic Road Damage Detection

Deeksha Arya , Hiroya Maeda , Sanjay Kumar Ghosh , Durga Toshniwal , Yoshihide Sekimoto

分类：计算机视觉 | 人工智能 | 机器学习

2022-09-18

数据文章介绍了路线损坏数据集RDD2022，其中包括来自六个国家，日本，印度，捷克共和国，挪威，美国和中国的47,420条道路图像。图像已注释了超过55,000个道路损坏的实例。数据集中捕获了四种类型的道路损坏，即纵向裂缝，横向裂纹，鳄鱼裂纹和坑洼。设想注释的数据集用于开发基于深度学习的方法以自动检测和对道路损害进行分类。该数据集已作为基于人群传感的道路伤害检测挑战（CRDDC2022）的一部分发布。 CRDDC2022挑战邀请了来自全球的研究人员提出解决方案，以在多个国家 /地区自动道路损害检测。市政当局和道路机构可以使用RDD2022数据集，并使用RDD2022培训的模型用于低成本自动监测道路状况。此外，计算机视觉和机器学习研究人员可能会使用数据集对其他类型的其他基于图像的应用程序（分类，对象检测等）进行不同算法的性能。

translated by 谷歌翻译

Binary Iterative Hard Thresholding Converges with Optimal Number of Measurements for 1-Bit Compressed Sensing

Namiko Matsumoto , Arya Mazumdar

分类： (统计)机器学习

2022-07-07

压缩传感一直是依赖线性操作的非常成功的高维信号采集和恢复技术。但是，在存储或处理之前，必须对信号的实际测量进行量化。 1（一个） - 位压缩传感是压缩传感的大量量化版本，在其中，信号的每个线性测量都降低到一个位：测量的符号。一旦收集了足够的测量结果，1位压缩感应中的恢复问题旨在以尽可能准确的方式找到原始信号。恢复问题与学习理论中传统的“半空间学习”问题有关。为了恢复稀疏矢量，从1位测量值中的流行重建方法是二元迭代硬阈值（BIHT）算法。该算法是一种简单的投影次级下降法，尽管该问题的概念性不佳，但已知在经验上均能很好地收敛。 BIHT的收敛性属性在理论上没有合理的理由，除了大量的测量值（即，许多大于$ \ max \ {k^{10}，24^{48}，k^{3.5}/ k^{3.5}/ \ epsilon \} $，其中$ k $是稀疏性，$ \ epsilon $表示近似错误，甚至该表达式隐藏了其他因素）。在本文中，我们表明，BIHT算法仅通过$ \ tilde {o}收敛（\ frac {k} {\ epsilon}）$测量。请注意，这种依赖性对$ k $和$ \ epsilon $对于1位压缩传感中的任何恢复方法都是最佳的。据我们所知，BIHT是唯一需要所有参数（$ K $和$ \ epsilon $）中最佳测量值的实用和高效（多项式时间）算法。这也是在适当的结构条件下，梯度下降算法转化为非凸问题的正确解决方案的示例。

translated by 谷歌翻译

Community Recovery in the Geometric Block Model

Sainyam Galhotra , Arya Mazumdar , Soumyabrata Pal , Barna Saha

分类：机器学习

2022-06-22

为了捕获许多社区检测问题的固有几何特征，我们建议使用一个新的社区随机图模型，我们称之为\ emph {几何块模型}。几何模型建立在\ emph {随机几何图}（Gilbert，1961）上，这是空间网络的随机图的基本模型之一，就像在ERD \ H上建立的良好的随机块模型一样{o} s-r \'{en} yi随机图。它也是受到社区发现中最新的理论和实际进步启发的随机社区模型的自然扩展。为了分析几何模型，我们首先为\ emph {Random Annulus图}提供新的连接结果，这是随机几何图的概括。自引入以来，已经研究了几何图的连通性特性，并且由于相关的边缘形成而很难分析它们。然后，我们使用随机环形图的连接结果来提供必要的条件，以有效地为几何块模型恢复社区。我们表明，一种简单的三角计数算法来检测几何模型中的社区几乎是最佳的。为此，我们考虑了两个图密度方案。在图表的平均程度随着顶点的对数增长的状态中，我们表明我们的算法在理论上和实际上都表现出色。相比之下，三角计数算法对于对数学度方案中随机块模型远非最佳。我们还查看了图表的平均度与顶点$ n $的数量线性增长的状态，因此要存储一个需要$ \ theta（n^2）$内存的图表。我们表明，我们的算法需要在此制度中仅存储$ o（n \ log n）$边缘以恢复潜在社区。

translated by 谷歌翻译